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DETAILED ACTION 

1 . The amendment received March 5, 2008 in response to the Office Action of July 
10, 2007 has been received and is entered. 

2. Upon review and reconsideration, the finality of the previous action will be 
vacated and prosecution of application 10/771,620 is reopened. 

3. Claims 1, 3, 8, and 9 have been amended, claims 2, 4-7, 10-65 have been 
cancelled, and new claims 66-69, 71, and 72 have been added. 

4. Claims 1, 3, 8, 9, 66-69, 71, and 72 are pending and under consideration. 

Rejections Maintained 
Claim Rejections - 35 USC § 112 
The following is a quotation of the first paragraph of 35 U.S.C. 1 12: 

The specification shall contain a written description of the invention, and of the manner and process of making 
and using it, in such full, clear, concise, and exact terms as to enable any person skilled in the art to which it 
pertains, or with which it is most nearly connected, to make and use the same and shall set forth the best mode 
contemplated by the inventor of carrying out his invention. 

5. Claims 1, 3, 8, and 9 remain rejected and claims 66-69, 71, and 72 are rejected 
under 35 U.S.C. 1 12, first paragraph, for the reasons previously set forth in section 5, pages 4-6 
of the Office Action of July 10, 2007. 

Applicants argue that the claims are in condition for allowance. 

Applicants arguments have been considered, but have not been found persuasive upon 
reconsideration as the claims are not limited to detecting SEQ ID NO: 3 or SEQ ID NO: 4 or the 
full-length complement of either as the alternative language of claim 1 still reads on the variants 
encompassed by FLJ20174. Thus, the claims remain rejected for the reasons previously set forth. 
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It is noted that amendment of the claims to assaying the level of FLJ20174 nucleic acid 
selected from the group consisting of SEQ ID NO: 3 or the full length complement thereof or 
SEQ ID NO: 4 complement thereof or the full length or removal of the reference to FLJ20174 
would obviate this rejection. 

6. Claims 1, 3, 8, and 9 remain rejected and claims 66-69, 71, and 72 are rejected 
under 35 U.S.C. 1 12, first paragraph, as failing to comply with the written description 
requirement for the reasons previously set forth in section 6, pages 6-7 of the Office Action of 
July 10, 2007. 

Applicants argue that the claims are in condition for allowance. 

Applicants arguments have been considered, but have not been found persuasive upon 
reconsideration as the claims are not limited to detecting SEQ ID NO: 3 or SEQ ID NO: 4 or the 
full-length complement of either as the alternative language of claim 1 still reads on the variants 
encompassed by FLJ20174. Thus, the claims remain rejected for the reasons previously set forth. 

It is noted that amendment of the claims to assaying the level of FLJ20174 nucleic acid 
selected from the group consisting of SEQ ID NO: 3 or the full length complement thereof or 
SEQ ID NO: 4 complement thereof or the full length or removal of the reference to FLJ20174 
would obviate this rejection. 

New Grounds of Rejection 
Claim Rejections - 35 USC § 112 

The following is a quotation of the second paragraph of 35 U.S.C. 1 12: 

The specification shall conclude with one or more claims particularly pointing out and distinctly claiming the 
subject matter which the applicant regards as his invention. 
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7. Claims 71 and 72 are rejected under 35 U.S.C. 1 12, second paragraph, as being 
indefinite for failing to particularly point out and distinctly claim the subject matter which 
applicant regards as the invention. 

Claim 71 recites the limitation "the probe" in claim 1 . There is insufficient antecedent 
basis for this limitation in the claim. 

Claim Rejections - 35 USC § 102 

The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that form the 
basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(e) the invention was described in (1) an application for patent, published under section 122(b), by another filed 
in the United States before the invention by the applicant for patent or (2) a patent granted on an application for 
patent by another filed in the United States before the invention by the applicant for patent, except that an 
international application filed under the treaty defined in section 351(a) shall have the effects for purposes of this 
subsection of an application filed in the United States only if the international application designated the United 
States and was published nuclei' Article 21(2) of such treaty in the Hnglish language. 

8. Claims 1, 3, 9, 66-69, 71, and 72 are rejected under 35 U.S.C. 102(e) as being 
anticipated by Tan et al (US Patent App. Publication 2005/0170351 Al, Feb. 20, 2003) as 
evidenced by Appendix 1 and Affymetrix (GeneChip* Data Sheet Human Genome Arrays, 2003, 
p. 1-4). 

The claims are drawn to: 

1 . A method of screening for breast cancer in a subject, the method comprising 
obtaining a sample containing breast cells from the subject, assaying the level of 
FLJ20174 nucleic acid, SEQ ID NO:3 or SEQ ID NO:4 or the full length complement of 
either thereof, comparing the level of FLJ20174 nucleic acid, SEQ ID NO:3 or SEQ ID 
NO:4 or the full length complement of either thereof, in the sample from the subject with 
the level of FLJ20174 nucleic acid, SEQ ID NO:3 or SEQ ID NO:4 or the full length 
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complement of either thereof, in one or more control samples from one or more non- 
cancerous breast tissues, wherein an increase of at least two-fold in the level of 
FLJ20174, SEQ ID NO:3 or SEQ ID NO:4 or the full length complement of either 
thereof, in the subject sample compared to the control samples is indicative of breast 
cancer in the subject and wherein assaying the level comprises an amplification step. 
3. The method of claim 1, wherein the one or more control breast tissue samples from a 
non-cancerous breast tissue are also from the subject. 

9. The method of claim 1, wherein the level of FLJ20174, SEQ ID NO:3 or SEQ ID 
NO:4 or the full length complement thereof is determined by assaying the sample with a 
probe or primer consisting of 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20,21,22,23,24, 25, 
26, 27, 28, 29, 30 or more contiguous nucleotides of SEQ ID NO:3 or SEQ ID NO:4, or 
the full length complement thereof. 

66. The method of claim 1, wherein the step of assaying comprises a polymerase 
chain reaction step. 

67. The method of claim 1, wherein the step of assaying comprises a reverse 
transcriptase polymerase chain reaction step. 

68. The method of claim 1 , wherein the step of assaying comprises a DNA to DNA 
hybridization step. 

69. The method of claim 1 , wherein the step of assaying comprises a DNA to RNA 
hybridization step. 

71. The method of claim 1, wherein the probe is affixed to a solid support. 
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72. The method of claim 71, wherein the solid support is a membrane, a microtiter plate, 
or a polystyrene bead. 

Tan et al. teach the identification of FLJ20174 nucleic acid by microarray analysis using 
Affymetrix U133A Genechips of breast tumors samples as a gene expressed in ER negative 
type 2 tumors, where its expression is changed at least two fold compared to other samples 
including normal samples, see para. 0198-0224 and Table ER-subtype II. Tan et al. teach that 
all tumor and normal tissues were simultaneously harvested during surgical excision of the 
tumor, see 0124. Tan et al. teach that the RNA used for hybridization to the cDNA microarrays 
was generated from RNA of the samples after a single linear amplification step, see 0126. Tan 
et al. teach that PCR can be used to amplify mRNA with the prior conversion of the RNA to 
cDNA, which is reverse transcription, for the PCR amplification, which is well known in the art 
to comprise DNA to DNA hybridization, for detection of the polynucleotides of the invention, 
see para.0030, 0035, and 005 1-0055. Tan et al. teach detection of the nucleic acids with cDNA 
probes, see para. 005 1 and 0052. Tan et al. teach using probes affixed to solid supports, such as 
membranes and beads for detecting nucleic acids, see para. 0055-0060. 

Given that Tan et al. teach using cDNAs for detecting nucleotides of the invention and 
FLJ20174 is a cDNA identical to SEQ ID NO: 3 (see Appendix 1), Tan et al. teach a probe that 
is more than 30 contiguous nucleotides of SEQ ID NO: 3. 

Additionally, it is noted that the Affymetrix U133A gene chips used by Tan et al. contain 
25-mer oligonucleotide arrays complementary to the polynucleotides of the genes being 
screened, Affymetrix (GeneChip® Data Sheet Human Genome Arrays, 2003, p. 2 and 3). 
Although the Tan et al. reference does not specifically state that the probes of the Affymetrix 
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U133Agene chips contained a probe or primer consisting of 10, 11, 12, 13, 14, 15, 16, 17, 18, 
19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30 or more contiguous nucleotides of SEQ ID NO:3 or 
SEQ ID NO:4, given that FLJ20174 was detected using the Affymetrix U133A gene chips, 
which contain 25-mer oligonucleotide arrays complementary to the sequences being screened, 
the claimed product appears to be the same as the prior art product, absent a showing of 
unobvious differences. The office does not have the facilities and resources to provide the 
factual evidence needed in order to establish that the product of the prior art does not possess 
the same material, structural and functional characteristics of the claimed product in the method 
of claim 9. In the absence of evidence to the contrary, the burden is on the applicant to prove 
that the claimed product in the method of claim 9 is different from those taught by the prior art 
and to establish patentable differences. See In re Best 562F.2d 1252, 195 USPQ 430 (CCPA). 

It is noted that a wherein clause in a method claim is not given weight when it simply 
expresses the intended result of a process step positively recited, MPEP 2111 .04. Given that the 
method of the prior art comprises the same method steps as claimed in the instant invention, 
obtaining a sample containing breast cells from the subject, assaying the level of FLJ20174 
nucleic acid, SEQ ID NO:3 or SEQ ID NO:4 or the full length complement of either thereof, 
comparing the level of FLJ20174 nucleic acid, SEQ ID NO:3 or SEQ ID NO:4 or the full length 
complement of either thereof, in the sample from the subject with the level of FLJ20174 nucleic 
acid, SEQ ID NO:3 or SEQ ID NO:4 or the full length complement of either thereof, in one or 
more control samples from one or more non-cancerous breast tissues and comprising an 
amplification step, the claimed method is anticipated because the method will inherently be a 
method of screening for breast cancer, wherein an increase of at least two-fold in the level of 
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FLJ20174, SEQ ID NO:3 or SEQ ID NO:4 or the full length complement of either thereof, in the 
subject sample compared to the control samples is indicative of breast cancer in the subject. See 
Ex parte Novitski 26 USPQ 1389 (BPAI 1993). Although the reference does not specifically 
state that an increase of at least two-fold in the level of FLJ20174, SEQ ID NO: 3 or SEQ ID 
NO: 4 or the full length complement of either thereof, in the subject sample compared to the 
control samples is indicative of breast cancer in the subject, the claimed method appears to be the 
same as the prior art method, absent a showing of unobvious differences. The office does not 
have the facilities and resources to provide the factual evidence needed in order to establish that 
the method of the prior art docs not possess the same material, structural and functional 
characteristics of the claimed method. In the absence of evidence to the contrary, the burden is 
on the applicant to prove that the claimed method is different from those taught by the prior art 
and to establish patentable differences. See In re Best 562F.2d 1252, 195 USPQ 430 (CCPA). 

Claim Rejections - 35 USC §103 

The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

The factual inquiries set forth in Graham v. John Deere Co., 383 U.S. 1, 148 USPQ 459 
(1966), that are applied for establishing a background for determining obviousness under 35 
U.S.C. 103(a) are summarized as follows: 

1 . Determining the scope and contents of the prior art. 

2. Ascertaining the differences between the prior art and the claims at issue. 
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3. Resolving the level of ordinary skill in the pertinent art. 

4. Considering objective evidence present in the application indicating obviousness 
or nonobviousness. 

This application currently names joint inventors. In considering patentability of the 
claims under 35 U.S.C. 103(a), the examiner presumes that the subject matter of the various 
claims was commonly owned at the time any inventions covered therein were made absent any 
evidence to the contrary. Applicant is advised of the obligation under 37 CFR 1 .56 to point out 
the inventor and invention dates of each claim that was not commonly owned at the time a later 
invention was made in order for the examiner to consider the applicability of 35 U.S.C. 103(c) 
and potential 35 U.S.C. 102(e), (f) or (g) prior art under 35 U.S.C. 103(a). 

9. Claim 8 is rejected under 35 U.S.C. 103(a) as being unpatentable over Tan et al 
(US Patent App. Publication 2005/0170351 Al, Feb. 20, 2003) as applied to claims 1, 3, 9, and 
66-69, 71, and 72 above, in view of Hung et al.(US Pat. No. 6,638, 727, January 26, 1999). 

Claim 8 is drawn to the method of claim 1, wherein the subject sample comprises nipple 
aspirate or ductal fluid obtained from the subject. 

Tan et al. teach as set forth above, but do not teach nipple aspirate or ductal fluid 
obtained from the subject. 

Hung et al teach that nipple aspirate fluid is a promising non-invasive method to identify 
cellular markers of breast cancer risk, see Col 2, line 60 to Col. 3, line 5. Hung et al teach using 
ductal fluid by nipple aspiration for the identification of the breast cancer cells, see col. 3 and 4. 

It would be prime facie obvious to one of skill in the art at the time the invention was 
made to use nipple aspirate or ductal fluid from the subject as the sample to assay the level of 
FLJ20174 nucleic acid because these are typical samples used in the art to screen for breast 
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cancer as taught by Hung et al. One of skill in the art would have been motivated with a 
reasonable expectation of success to use these types of samples because they were normally used 
in the art for breast cancer detection. 

10. No claims allowed. 

1 1 . Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to PETER J. REDDIG whose telephone number is (571)272-9031. 
The examiner can normally be reached on M-F 8:30 a.m. -5:00 p.m. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Helms Larry can be reached on (57 1 ) 272-0832. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would 
like assistance from a USPTO Customer Service Representative or access to the automated 
information system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 



/Peter J Reddig/ 
Examiner, Art Unit 1642 
/P. J. RJ 



Application/Control Number: 10/771,620 
Art Unit: 1643 



Page 1 1 



/Karen A Canella/ 

Primary Examiner, Art Unit 1643 



Appendix 1 



AK000181 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



PRI 12-SEP-2006 



TITLE 
JOURNAL 
REFERENCE 
AUTHORS 



Ota,T., Suzuki, Y. 
Nakamura, Y. , Isoi 
icing project 



Obayashi,M., Nishi,T., 
ai,T. and Sugano,S. 



Nishi,T., Isogai,T., 



AK000181 
Homo sapiens 
AK000181 

AK000181.1 GI:7020098 

FLI_C DNA; oligo capping; fis (full insert sequence) . 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 

1 

Kawabata,A., Hikiji,T., Kobatake,N., Inagaki,H., Ikema,Y., 
Okamoto,S., Okitani,R., 
Shibahara,T. , Tanaka,T., 
NEDO human cDNA sequeni 
Unpublished 
2 (bases 1 to 4669) 

Sugano,S., Suzuki, Y., Ota,T., Obayashi,M., 
Shibahara, T . , Tanaka,T. and Nakamura, Y. 
Direct Submission 

Submitted ( 15-FEB-2 000 ) Sumio Sugano, Institute of Medical Science, 
University of Tokyo, Deptment of Virology; Shirokane-dai, 4-6-1, 
Minato-ku, Tokyo 108-8639, Japan (E-mail : flcdnaS ims . u-tokyo . ac . jp, 
Tel: 81-3-54 4 9-528 6, Fax:81-3-5449-5416) 

NEDO human cDNA sequencing project supported by Ministry of 
International Trade and Industry of Japan; cDNA full insert 
sequencing: Research Association for Biotechnology; cDNA library 
construction, 5'- & 3 '-end one pass sequencing: Departent of 
Virology and Human Genome Center, Institute of Medical Science, 
University of Tokyo (partly supported by Science and Technology 
Agency) . 

Location/Qualifiers 
: 1. .4669 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon: 9606" 

/clone="COL09863" 

/ tissue_type="colon" 

/clone_lib="COL" 

/note="cloning vector pME18SFL3" 
377. .2860 

/note="unnamed protein product" 
/codon_start=l 
/protein_id="BAA90994 .1" 
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/db_xref="GI : 7020099" 

/ trans lation="MRGCLRLALLCALPWLLLAASPGHPAKSPRQPPAPRRDPFDAAR 
GADFDHVYSGWNLSTENIYSFNYTSQPDQVTAVRVYVNSSSENLNYPVLWVRQQKE 
VLSWQVPLLFQGLYQRSYNYQEVSRTLCPSEATNETGPLQQLIFVDVASMAPLGAQYK 
LLVTKLKHFQLRTNVAFHFTASPSQPQYFLYKFPKDVDSVIIKVVSEMAYPCSVVSVQ 
NIMCPVYDLDHNVEFNGVYQSMTKKAAITLQKKDFPGEQFFVVFVIKPEDYACGGSFF 
IQEKENQTWNLQRKKNLEVTIVPSIKESVYVKSSLFSVFI FLSFYLGCLLVGFVHYLR 
FQRKSI DGSFGSNDGSGNMVASHPIAASTPEGSNYGTI DESSSS PGRQMS SS DGGPPG 
QSDTDSSVEESDFDTMPDIESDKNIIRTKMFLYLSDLSRKDRRIVSKKYKI YFWNI IT 
IAVFYALPVIQLVITYQTWNVTGNQDICYYNFLCAHPLGVLSAFNNILSNLGHVLLG 
FLFLLIVLRRDILHRRALEAKDIFAVEYGIPKHFGLFYAMGIALMMEGVLSACYHVCP 
NYSNFRFDTSFMYMIAGLCMLKLYQTRHPDINASAYSAYASFAVVIMVTVLGWFGKN 
DVWFWVIFSAIHVLASLALSTQIYYMGRFKIDLGIFRRAAMVFYTDCIQQCSRPLYMD 
RMVLLVVGNLVNWS FALFGLI YRPRDFAS YMLGI FI CNLLLYLAFY 1 1 MKLRS S EKVL 
PVPLFCIVATAVMWAAALYFFFQNLSSWEGTPAESREKNRECILLDFFDDHDIWHFLS 
ATALFFSFLVLLTLDDDLDWRRDQI PVF" 

ORIGIN 

Query Match 100.0%; Score 4669; DB 5; Length 4669; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 4669; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 TTTTTATTATTGCTGTTATTGAGGTTGAGGGAGAAGAGATCGGTCTAAATTCTGGCTGGG 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 TTTTTATTATTGCTGTTATTGAGGTTGAGGGAGAAGAGATCGGTCTAAATTCTGGCTGGG 60 

Qy 61 TAAGTGGGGGGATTCTCGGCGATGAGAAACGGGGGACTTAGAAGCCGGAGGAAAATCAGC 12 0 

Db 61 TAAGTGGGGGGATTCTCGGCGATGAGAAACGGGGGACTTAGAAGCCGGAGGAAAATCAGC 12 0 

Qy 121 AGCCCCACATCTCCACTTCTCCAGTCCGCCCTACTCTCCACCCGTGACCTCCAGTGGAGA 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 AGCCCCACATCTCCACTTCTCCAGTCCGCCCTACTCTCCACCCGTGACCTCCAGTGGAGA 180 

Qy 181 CCCCAGGCGGCAGCATCAGTATTTGATCGGCCCTTCGTCAGCACGCTGCCAGCCCTGGCC 24 0 

Db 181 CCCCAGGCGGCAGCATCAGTATTTGATCGGCCCTTCGTCAGCACGCTGCCAGCCCTGGCC 24 0 

Qy 241 GGCTGGGTTCGCCAGGCATCACCCGCTCGGCTCTGAAGCGGACGCCTGGCCCTGCACCGG 300 

Db 241 GGCTGGGTTCGCCAGGCATCACCCGCTCGGCTCTGAAGCGGACGCCTGGCCCTGCACCGG 300 

Qy 301 GCTTTGGAAGGACCCTCTCTGCGCTCGCCCCCTCCCCAGGGTGGCTCCGCTTTCGAGCCC 360 

Db 301 GCTTTGGAAGGACCCTCTCTGCGCTCGCCCCCTCCCCAGGGTGGCTCCGCTTTCGAGCCC 360 

Qy 3 61 GGGCGCGGCGCCCACCATGCGCGGCTGCCTGCGGCTCGCGCTGCTCTGCGCGCTGCCCTG 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3 61 GGGCGCGGCGCCCACCATGCGCGGCTGCCTGCGGCTCGCGCTGCTCTGCGCGCTGCCCTG 42 0 

Qy 421 GCTCCTGCTGGCGGCGTCGCCCGGGCACCCGGCGAAATCCCCCAGGCAGCCCCCGGCACC 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 GCTCCTGCTGGCGGCGTCGCCCGGGCACCCGGCGAAATCCCCCAGGCAGCCCCCGGCACC 480 

Qy 4 81 GCGCCGCGACCCCTTCGACGCTGCCAGGGGCGCCGATTTCGATCATGTCTACAGCGGGGT 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4 81 GCGCCGCGACCCCTTCGACGCTGCCAGGGGCGCCGATTTCGATCATGTCTACAGCGGGGT 54 0 

Qy 541 GGTGAACCTCAGCACCGAGAACATCTACTCTTTCAACTACACCAGCCAGCCCGACCAGGT 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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541 GGTGAACCTCAGCACCGAGAACATCTACTCTTTCAACTACACCAGCCAGCCCGACCAGGT 600 

601 GACAGCCGTGAGGGTGTATGTGAACAGTTCCTCTGAGAATCTCAACTACCCGGTCCTTGT 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
601 GACAGCCGTGAGGGTGTATGTGAACAGTTCCTCTGAGAATCTCAACTACCCGGTCCTTGT 660 

6 61 TGTGGTTCGCCAGCAGAAAGAGGTGCTGTCCTGGCAGGTTCCTCTGCTCTTCCAAGGACT 72 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

6 61 TGTGGTTCGCCAGCAGAAAGAGGTGCTGTCCTGGCAGGTTCCTCTGCTCTTCCAAGGACT 72 0 

7 21 ATACCAGAGGAGCTACAATTATCAAGAAGTGAGCCGCACCTTATGTCCCTCAGAAGCAAC 7 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
7 21 ATACCAGAGGAGCTACAATTATCAAGAAGTGAGCCGCACCTTATGTCCCTCAGAAGCAAC 7 80 

7 81 CAATGAGACGGGACCCTTGCAGCAACTGATATTTGTAGATGTCGCATCCATGGCACCCCT 84 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
7 81 CAATGAGACGGGACCCTTGCAGCAACTGATATTTGTAGATGTCGCATCCATGGCACCCCT 84 0 

841 GGGTGCTCAGTACAAACTGCTAGTTACCAAGCTGAAGCACTTCCAGCTCCGGACAAATGT 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
841 GGGTGCTCAGTACAAACTGCTAGTTACCAAGCTGAAGCACTTCCAGCTCCGGACAAATGT 900 

901 TGCCTTTCACTTTACTGCCAGCCCCTCTCAACCTCAGTATTTTCTATACAAGTTTCCCAA 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
901 TGCCTTTCACTTTACTGCCAGCCCCTCTCAACCTCAGTATTTTCTATACAAGTTTCCCAA 960 

9 61 AGACGTGGACTCAGTTATCATTAAAGTGGTGTCTGAAATGGCTTATCCATGTTCTGTTGT 1020 

9 61 AGACGTGGACTCAGTTATCATTAAAGTGGTGTCTGAAATGGCTTATCCATGTTCTGTTGT 1020 

1021 CTCAGTCCAGAATATCATGTGCCCGGTGTATGATCTCGACCACAATGTGGAATTTAATGG 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1021 CTCAGTCCAGAATATCATGTGCCCGGTGTATGATCTCGACCACAATGTGGAATTTAATGG 1080 

1081 TGTCTATCAGTCCATGACCAAGAAAGCTGCCATCACGCTACAGAAGAAGGATTTTCCAGG 1140 

1081 TGTCTATCAGTCCATGACCAAGAAAGCTGCCATCACGCTACAGAAGAAGGATTTTCCAGG 114 0 

1141 CGAGCAGTTCTTCGTGGTATTTGTGATAAAGCCTGAAGATTATGCCTGTGGAGGATCTTT 12 00 

1141 CGAGCAGTTCTTCGTGGTATTTGTGATAAAGCCTGAAGATTATGCCTGTGGAGGATCTTT 12 00 

12 01 CTTCATCCAGGAAAAGGAAAACCAGACCTGGAATCTACAGCGAAAAAAGAACCTTGAAGT 12 60 

12 01 CTTCATCCAGGAAAAGGAAAACCAGACCTGGAATCTACAGCGAAAAAAGAACCTTGAAGT 12 60 

12 61 GACCATTGTCCCTTCCATTAAAGAATCTGTTTATGTGAAATCCAGTCTTTTCAGTGTCTT 1320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
12 61 GACCATTGTCCCTTCCATTAAAGAATCTGTTTATGTGAAATCCAGTCTTTTCAGTGTCTT 1320 

1321 CATCTTCCTGTCCTTCTACTTGGGATGCCTTCTTGTTGGGTTTGTTCATTATCTGAGGTT 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1321 CATCTTCCTGTCCTTCTACTTGGGATGCCTTCTTGTTGGGTTTGTTCATTATCTGAGGTT 1380 

1381 TCAGAGAAAATCCATTGATGGAAGCTTTGGGTCCAATGATGGCTCTGGAAATATGGTGGC 14 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1381 TCAGAGAAAATCCATTGATGGAAGCTTTGGGTCCAATGATGGCTCTGGAAATATGGTGGC 14 4 0 

14 41 ATCTCATCCCATTGCTGCCAGCACACCCGAAGGGAGCAATTATGGGACAATAGATGAGTC 1500 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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14 41 ATCTCATCCCATTGCTGCCAGCACACCCGAAGGGAGCAATTATGGGACAATAGATGAGTC 1500 

1501 AAGCTCCAGTCCTGGAAGGCAGATGTCCTCCTCCGATGGTGGGCCACCGGGCCAGTCAGA 15 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1501 AAGCTCCAGTCCTGGAAGGCAGATGTCCTCCTCCGATGGTGGGCCACCGGGCCAGTCAGA 15 60 

15 61 CACAGACAGCTCCGTGGAGGAGAGCGACTTCGACACCATGCCAGACATTGAGAGTGATAA 1620 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
15 61 CACAGACAGCTCCGTGGAGGAGAGCGACTTCGACACCATGCCAGACATTGAGAGTGATAA 1620 

1621 AAACATCATCCGGACCAAGATGTTCCTTTACCTGTCAGATTTGTCCAGGAAGGACCGGAG 1680 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1621 AAACATCATCCGGACCAAGATGTTCCTTTACCTGTCAGATTTGTCCAGGAAGGACCGGAG 1680 

1681 AATTGTCAGCAAAAAATATAAAATTTATTTTTGGAACATCATCACCATTGCTGTGTTTTA 17 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1681 AATTGTCAGCAAAAAATATAAAATTTATTTTTGGAACATCATCACCATTGCTGTGTTTTA 17 4 0 

17 41 CGCGCTGCCCGTGATCCAGCTGGTCATTACCTATCAGACAGTGGTAAATGTCACTGGCAA 1800 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

17 41 CGCGCTGCCCGTGATCCAGCTGGTCATTACCTATCAGACAGTGGTAAATGTCACTGGCAA 1800 

1801 CCAGGACATCTGTTACTACAACTTCCTCTGTGCTCACCCCTTGGGCGTCCTGAGTGCCTT 18 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1801 CCAGGACATCTGTTACTACAACTTCCTCTGTGCTCACCCCTTGGGCGTCCTGAGTGCCTT 18 60 

18 61 CAACAACATTCTCAGCAATCTGGGCCACGTGCTTCTGGGCTTCCTCTTCCTGCTGATAGT 1920 

18 61 CAACAACATTCTCAGCAATCTGGGCCACGTGCTTCTGGGCTTCCTCTTCCTGCTGATAGT 1920 

1921 CTTGCGCCGCGACATCCTCCATCGGAGAGCCCTGGAAGCCAAGGACATCTTTGCTGTGGA 1980 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1921 CTTGCGCCGCGACATCCTCCATCGGAGAGCCCTGGAAGCCAAGGACATCTTTGCTGTGGA 1980 

1981 GTACGGGATTCCCAAACACTTTGGTCTCTTCTACGCTATGGGCATTGCATTGATGATGGA 204 0 

1981 GTACGGGATTCCCAAACACTTTGGTCTCTTCTACGCTATGGGCATTGCATTGATGATGGA 204 0 

2041 AGGGGTGCTCAGTGCTTGCTACCATGTCTGCCCTAATTATTCCAACTTCCGATTCGACAC 2100 

2041 AGGGGTGCTCAGTGCTTGCTACCATGTCTGCCCTAATTATTCCAACTTCCGATTCGACAC 2100 

2101 CTCCTTCATGTACATGATCGCTGGCCTGTGCATGCTGAAGCTCTATCAGACCCGCCACCC 2160 

2101 CTCCTTCATGTACATGATCGCTGGCCTGTGCATGCTGAAGCTCTATCAGACCCGCCACCC 2160 

2161 AGACATCAATGCCAGCGCCTACTCTGCCTATGCCTCCTTTGCTGTGGTCATCATGGTCAC 2220 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2161 AGACATCAATGCCAGCGCCTACTCTGCCTATGCCTCCTTTGCTGTGGTCATCATGGTCAC 2220 

2221 CGTCCTTGGAGTGGTGTTTGGAAAAAATGACGTATGGTTCTGGGTCATCTTCTCTGCAAT 22 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2221 CGTCCTTGGAGTGGTGTTTGGAAAAAATGACGTATGGTTCTGGGTCATCTTCTCTGCAAT 22 80 

22 81 CCACGTTCTGGCCTCGCTAGCCCTCAGCACCCAGATATATTATATGGGTCGTTTCAAGAT 2340 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
22 81 CCACGTTCTGGCCTCGCTAGCCCTCAGCACCCAGATATATTATATGGGTCGTTTCAAGAT 2340 

2341 AGATTTGGGAATTTTCCGGCGGGCTGCCATGGTGTTCTACACAGACTGTATCCAGCAGTG 24 00 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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2341 AGATTTGGGAATTTTCCGGCGGGCTGCCATGGTGTTCTACACAGACTGTATCCAGCAGTG 24 00 

24 01 TAGCCGACCTCTATATATGGATAGAATGGTGTTGCTGGTTGTGGGGAATCTGGTTAACTG 24 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

24 01 TAGCCGACCTCTATATATGGATAGAATGGTGTTGCTGGTTGTGGGGAATCTGGTTAACTG 24 60 

24 61 GTCCTTCGCCCTCTTTGGATTGATATACCGCCCCAGGGACTTTGCTTCCTACATGCTGGG 2520 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

24 61 GTCCTTCGCCCTCTTTGGATTGATATACCGCCCCAGGGACTTTGCTTCCTACATGCTGGG 2520 

2521 CATCTTCATCTGTAACCTTTTGCTGTACCTGGCCTTTTACATCATCATGAAGCTCCGCAG 2580 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

2521 CATCTTCATCTGTAACCTTTTGCTGTACCTGGCCTTTTACATCATCATGAAGCTCCGCAG 2580 

2581 CTCTGAAAAGGTCCTCCCAGTCCCGCTCTTCTGCATCGTGGCCACCGCTGTGATGTGGGC 2 64 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

2581 CTCTGAAAAGGTCCTCCCAGTCCCGCTCTTCTGCATCGTGGCCACCGCTGTGATGTGGGC 2 64 0 

2 641 TGCCGCCCTATATTTTTTCTTCCAGAATCTCAGCAGCTGGGAGGGAACTCCGGCCGAATC 27 00 



27 01 CCGGGAGAAGAACCGCGAGTGCATTCTGCTGGATTTCTTCGATGACCATGACATCTGGCA 27 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

27 01 CCGGGAGAAGAACCGCGAGTGCATTCTGCTGGATTTCTTCGATGACCATGACATCTGGCA 27 60 

27 61 CTTCCTCTCTGCTACTGCTCTGTTTTTCTCATTCTTGGTTTTGTTAACTTTGGATGATGA 2820 

27 61 CTTCCTCTCTGCTACTGCTCTGTTTTTCTCATTCTTGGTTTTGTTAACTTTGGATGATGA 2820 

2821 CCTTGATGTGGTTCGGAGAGACCAGATCCCTGTCTTCTGAACCTCCAACATTAAGAGAGG 2880 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

2821 CCTTGATGTGGTTCGGAGAGACCAGATCCCTGTCTTCTGAACCTCCAACATTAAGAGAGG 2880 

2881 GGAGGGAGCGATCAATCTTGGTGCTGTTTCACAAAAATTACAGTGACCACAGCAAAGTAA 2 94 0 

2881 GGAGGGAGCGATCAATCTTGGTGCTGTTTCACAAAAATTACAGTGACCACAGCAAAGTAA 2 94 0 

2 941 CCACTGCCAGATGCTCCACTCACCCTCTGTAGAGCCAACTCTGCATTCACACAGGAAGGA 3000 

2 941 CCACTGCCAGATGCTCCACTCACCCTCTGTAGAGCCAACTCTGCATTCACACAGGAAGGA 3000 

3001 GAGGGGCTGCGGGAGATTTAAACCTGCAAGAAAGGAGGCAGAAGGGGAGCCATGTTTTGA 30 60 

3001 GAGGGGCTGCGGGAGATTTAAACCTGCAAGAAAGGAGGCAGAAGGGGAGCCATGTTTTGA 30 60 

30 61 GGACAGACGCAAACCTGAGGAGCTGAGAAACACTTGCTCCTTCCATCTGCAGCTTTGGGA 3120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

30 61 GGACAGACGCAAACCTGAGGAGCTGAGAAACACTTGCTCCTTCCATCTGCAGCTTTGGGA 3120 

3121 GTGCAACAGGGATAGGCACTGCATCCAAGTCAACTCACCATCTTGGGGTCCCTCCCACCC 3180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

3121 GTGCAACAGGGATAGGCACTGCATCCAAGTCAACTCACCATCTTGGGGTCCCTCCCACCC 3180 

3181 TCACGGAGACTTGCCAGCAATGGCAGAATGCTGCTGCACACTTCCCTCCAGTTGTCACCC 32 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

3181 TCACGGAGACTTGCCAGCAATGGCAGAATGCTGCTGCACACTTCCCTCCAGTTGTCACCC 32 4 0 

32 41 TGCCCAGAAAGGCCAGCAGCTTGGACTTCCTGCCCAGAAACTGTGTTGGCCCCCTTCACA 3300 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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32 41 TGCCCAGAAAGGCCAGCAGCTTGGACTTCCTGCCCAGAAACTGTGTTGGCCCCCTTCACA 3300 

3301 CCTCTGCAACACCTGCTGCTCCAGCAAGAGGATGTGATTCTTTAGAATATGGCGGGGAGG 33 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3301 CCTCTGCAACACCTGCTGCTCCAGCAAGAGGATGTGATTCTTTAGAATATGGCGGGGAGG 33 60 

33 61 TGACCCCAGGCCCTGCCCTACTGGGATAGATGTTTTAATGGCACCAGCTAGTCACCTCCC 34 20 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

33 61 TGACCCCAGGCCCTGCCCTACTGGGATAGATGTTTTAATGGCACCAGCTAGTCACCTCCC 34 20 

34 21 AGAAGAAACTCTGTACATTTCCCCCAGGTTTCTGATGCCATCAGAAGGGCTCAGGAGTGG 34 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
34 21 AGAAGAAACTCTGTACATTTCCCCCAGGTTTCTGATGCCATCAGAAGGGCTCAGGAGTGG 34 80 

34 81 GGTTTGTCACACATTCCTCTTAACAAGTAACTGTCACTGGGACCGAGTCCTGGGTGCTTA 354 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
34 81 GGTTTGTCACACATTCCTCTTAACAAGTAACTGTCACTGGGACCGAGTCCTGGGTGCTTA 354 0 

3541 CATATTCCTTCGTGTCTTCATCTCACTGACCTGTGTGGACCTCATCACTCTGACTCTGCC 3600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3541 CATATTCCTTCGTGTCTTCATCTCACTGACCTGTGTGGACCTCATCACTCTGACTCTGCC 3600 

3601 TTCTTGGAAAGGCCCTGTCACTCCACAGATGTCTGGCCAGCTTCAAGGCAGAAGGAAAAA 36 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3601 TTCTTGGAAAGGCCCTGTCACTCCACAGATGTCTGGCCAGCTTCAAGGCAGAAGGAAAAA 36 60 

36 61 CAGGAAAAGCTCTTTTAACAGCAGCAGGAACAAGAGAAATGACTAACCATACTAAAAGAC 3720 

36 61 CAGGAAAAGCTCTTTTAACAGCAGCAGGAACAAGAGAAATGACTAACCATACTAAAAGAC 3720 

3721 TGGTAACAGCAGCAGCAGCCAGACAGGCCTCACCTTAAGGACTTGGGCTGCCAGAGCAAA 3780 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

37 21 TGGTAACAGCAGCAGCAGCCAGACAGGCCTCACCTTAAGGACTTGGGCTGCCAGAGCAAA 37 80 

37 81 TTCAGCAGAGCTTATTTGGCCTCCCATTCACACAGCTCAGTTCTGTGCCCACATCACCTT 384 0 

37 81 TTCAGCAGAGCTTATTTGGCCTCCCATTCACACAGCTCAGTTCTGTGCCCACATCACCTT 384 0 

3841 TGGGGAAGAAATCAGCATTCTAATCAGGGACACTACTTCAGGAGTCCTCCACAGCGAGTC 3900 

3841 TGGGGAAGAAATCAGCATTCTAATCAGGGACACTACTTCAGGAGTCCTCCACAGCGAGTC 3900 

3901 CGTCATCTGTCACTTTATGTAGATCAGGGTTCTAGACTTCTTCCCTGAGGTTCTCAGAAG 39 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3901 CGTCATCTGTCACTTTATGTAGATCAGGGTTCTAGACTTCTTCCCTGAGGTTCTCAGAAG 39 60 

39 61 CAGCTCTCAGGATGAACGTATTGTCCTCTTCCCCTCTTCTTGCAAAGTGCACAGCTAATC 4 020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
39 61 CAGCTCTCAGGATGAACGTATTGTCCTCTTCCCCTCTTCTTGCAAAGTGCACAGCTAATC 4 020 

4 021 TAATGTTGTCTCTCGGTTGCACCTGACATTCTCTCCCCAGTAAGGTGTTGGCAAGCTCAG 4 080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
4 021 TAATGTTGTCTCTCGGTTGCACCTGACATTCTCTCCCCAGTAAGGTGTTGGCAAGCTCAG 4 080 

4 081 CATCTGGGTTCCACTCTCACACTGTCTGGCAGCTCTGTGTCTGAGAAGTTCTACATTGAC 414 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I 
4 081 CATCTGGGTTCCACTCTCACACTGTCTGGCAGCTCTGTGTCTGAGAAGTTCTACATTGAC 414 0 

4141 CAGGCCCCCTTGTTGCCTGGAGTATGACGTAATCAGAAAATAGACGTATAAATGTGCACA 42 00 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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4141 CAGGCCCCCTTGTTGCCTGGAGTATGACGTAATCAGAAAATAGACGTATAAATGTGCACA 42 00 

42 01 TGCGTATGTATTTGCTTGTGAAATTAAAGTCACCTCTTGCCTCTGCTTTCCTGATCATTC 42 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
42 01 TGCGTATGTATTTGCTTGTGAAATTAAAGTCACCTCTTGCCTCTGCTTTCCTGATCATTC 42 60 

42 61 GTTAGAGAAATGGATCAGGCATTTTTTTAAATTATTATTCTTTCTCTAAACTATTTGCAT 4 320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
42 61 GTTAGAGAAATGGATCAGGCATTTTTTTAAATTATTATTCTTTCTCTAAACTATTTGCAT 4 320 

4 321 TGTGTTCAAAAACCCATTTTAGAAGTTTGAACAGCAAGCTTTTCCTGATTTTAAAAACAC 4 380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
4 321 TGTGTTCAAAAACCCATTTTAGAAGTTTGAACAGCAAGCTTTTCCTGATTTTAAAAACAC 4 380 

4 381 AAAGTTGCTTTCAATGAAATATTTTGTGATTTTTTTAAAGTCCCCAAATGTGTACTTAGC 4 4 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
4 381 AAAGTTGCTTTCAATGAAATATTTTGTGATTTTTTTAAAGTCCCCAAATGTGTACTTAGC 4 4 4 0 

4 4 41 CTTCTGTTATTCCTTATTCTTTAAGCAGTGTTGGCTTCCATTGACCATATGAAGGCCACC 4 500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
4 4 41 CTTCTGTTATTCCTTATTCTTTAAGCAGTGTTGGCTTCCATTGACCATATGAAGGCCACC 4 500 

4 501 AATTAAATGGTTGTGTTAATCCAACATGTAAAAAACTTTTTGGCAGGGCACAGTGGCTCA 4 5 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
4501 AATTAAATGGTTGTGTTAATCCAACATGTAAAAAACTTTTTGGCAGGGCACAGTGGCTCA 45 60 

45 61 CGCCTGTAATCCCAAAGTGCTGGAATTTACAGGTAGGACCACCACACTTGGTCCACTTAC 4 620 

45 61 CGCCTGTAATCCCAAAGTGCTGGAATTTACAGGTAGGACCACCACACTTGGTCCACTTAC 4 620 

4 621 TTATAATAAACATTGATTTGGTCGTTAAAAAAAAAAAAAAAAAAAAAAA 4 669 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
4 621 TTATAATAAACATTGATTTGGTCGTTAAAAAAAAAAAAAAAAAAAAAAA 4 66 9 
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2 (bases 1 to 4669) 

Sugano,S., Suzuki, Y., Ota,T., Obayashi,M., Nishi,T., Isogai,T., 
Shibahara, T . , Tanaka,T. and Nakamura,Y. 

Direct Submission 

Submitted (15-FEB-2000) Sumio Sugano, Institute of Medical 
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University of Tokyo, Deptment of Virology; Shirokane-dai , 4-6-1, 
Minato-ku, Tokyo 108-8639, Japan (E-mail : cdnal@ims . u-tokyo . ac . jp, 
Tel : 81-3-5449-52 86, Fax:81-3-544 9-5416) 
COMMENT NEDO human cDNA sequencing project supported by Ministry of 

International Trade and Industry of Japan; cDNA full insert 
sequencing: Research Association for Biotechnology; cDNA library 
construction, 5'- & 3 ' -end one pass sequencing: Departent of 
Virology and Human Genome Center, Institute of Medical Science, 
University of Tokyo (partly supported by Science and Technology 
Agency) . 

FEATURES Location/Qualifiers 
source 1 . .4669 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db xref="taxon: 9606 " 

/clone="COL09863" 

/ tissue_type="colon" 

/clone_lib="COL" 

/note="cloning vector pME18SFL3" 
CDS 377. .2860 

/note="unnamed protein product" 
/codon_start=l 
/protein_id="' ^ . " 
/db xref="GI: 7020099" 



/ trans lation="MRGCLRLALLCALPWLLLAASPGHPAKSPRQPPAPRRDPFDAAR 



GADFDHVYSGWNLSTENIYSFNYTSQPDQVTAVRVYVNSSSENLNYPVLVWRQQKE 
VLSWQVPLLFQGLYQRSYNYQEVSRTLCPSEATNETGPLQQLIFVDVASMAPLGAQYK 
LLVTKLKHFQLRTNVAFHFTASPSQPQYFLYKFPKDVDSVIIKWSEMAYPCSWSVQ 
NIMC PVYDLDHNVEFNGVYQSMTKKAAITLQKKDFPGEQFFWFVIKPEDYACGGSFF 
IQEKENQTWNLQRKKNLEVTIVPSIKESVYVKSSLFSVFIFLSFYLGCLLVGFVHYLR 
FQRKSIDGSFGSNDGSGNMVASHPIAASTPEGSNYGTIDESSSSPGRQMSSSDGGPPG 
QSDTDSSVEESDFDTMPDIESDKNI IRTKMFLYLSDLSRKDRRIVSKKYKIYFWNI IT 
IAVFYALPVIQLVITYQTWNVTGNQDICYYNFLCAHPLGVLSAFNNILSNLGHVLLG 
FLFLLIVLRRDILHRRALEAKDIFAVEYGIPKHFGLFYAMGIALMMEGVLSACYHVCP 



NYSNFRFDTSFMYMIAGLCMLKLYQTRHPDINASAYSAYASFAWIMVTVLGWFGKN 
DVWFWVIFSAIHVLASLALSTQIYYMGRFKIDLGIFRRAAMVFYTDCIQQCSRPLYMD 
RMVLLWGNLVNWSFALFGLIYRPRDFASYMLGIFICNLLLYLAFYIIMKLRSSEKVL 



PVPLFCIVATAVMWAAALYFFFQNLSSWEGTPAESREKNRECILLDFFDDHDIWHFLS 
ATALFFSFLVLLTLDDDLDWRRDQIPVF" 

ORIGIN 



Application/Control Number: 10/771,620 
Art Unit: 1643 



Page 19 



1 tttttattat tgctgttatt gaggttgagg gagaagagat cggtctaaat tctggctggg 

61 taagtggggg gattctcggc gatgagaaac gggggactta gaagccggag gaaaatcagc 

121 agccccacat ctccacttct ccagtccgcc ctactctcca cccgtgacct ccagtggaga 

181 ccccaggcgg cagcatcagt atttgatcgg cccttcgtca gcacgctgcc agccctggcc 

241 ggctgggttc gccaggcatc acccgctcgg ctctgaagcg gacgcctggc cctgcaccgg 

301 gctttggaag gaccctctct gcgctcgccc cctccccagg gtggctccgc tttcgagccc 

361 gggcgcggcg cccaccatgc gcggctgcct gcggctcgcg ctgctctgcg cgctgccctg 

421 gctcctgctg gcggcgtcgc ccgggcaccc ggcgaaatcc cccaggcagc ccccggcacc 

481 gcgccgcgac cccttcgacg ctgccagggg cgccgatttc gatcatgtct acagcggggt 

541 ggtgaacctc agcaccgaga acatctactc tttcaactac accagccagc ccgaccaggt 

601 gacagccgtg agggtgtatg tgaacagttc ctctgagaat ctcaactacc cggtccttgt 

661 tgtggttcgc cagcagaaag aggtgctgtc ctggcaggtt cctctgctct tccaaggact 

721 ataccagagg agctacaatt atcaagaagt gagccgcacc ttatgtccct cagaagcaac 

781 caatgagacg ggacccttgc agcaactgat atttgtagat gtcgcatcca tggcacccct 

841 gggtgctcag tacaaactgc tagttaccaa gctgaagcac ttccagctcc ggacaaatgt 

901 tgcctttcac tttactgcca gcccctctca acctcagtat tttctataca agtttcccaa 

961 agacgtggac tcagttatca ttaaagtggt gtctgaaatg gcttatccat gttctgttgt 

1021 ctcagtccag aatatcatgt gcccggtgta tgatctcgac cacaatgtgg aatttaatgg 

1081 tgtctatcag tccatgacca agaaagctgc catcacgcta cagaagaagg attttccagg 

1141 cgagcagttc ttcgtggtat ttgtgataaa gcctgaagat tatgcctgtg gaggatcttt 

1201 cttcatccag gaaaaggaaa accagacctg gaatctacag cgaaaaaaga accttgaagt 

1261 gaccattgtc ccttccatta aagaatctgt ttatgtgaaa tccagtcttt tcagtgtctt 

1321 catcttcctg tccttctact tgggatgcct tcttgttggg tttgttcatt atctgaggtt 

1381 tcagagaaaa tccattgatg gaagctttgg gtccaatgat ggctctggaa atatggtggc 

1441 atctcatccc attgctgcca gcacacccga agggagcaat tatgggacaa tagatgagtc 

1501 aagctccagt cctggaaggc agatgtcctc ctccgatggt gggccaccgg gccagtcaga 

1561 cacagacagc tccgtggagg agagcgactt cgacaccatg ccagacattg agagtgataa 

1621 aaacatcatc cggaccaaga tgttccttta cctgtcagat ttgtccagga aggaccggag 

1681 aattgtcagc aaaaaatata aaatttattt ttggaacatc atcaccattg ctgtgtttta 

1741 cgcgctgccc gtgatccagc tggtcattac ctatcagaca gtggtaaatg tcactggcaa 

1801 ccaggacatc tgttactaca acttcctctg tgctcacccc ttgggcgtcc tgagtgcctt 

1861 caacaacatt ctcagcaatc tgggccacgt gcttctgggc ttcctcttcc tgctgatagt 

1921 cttgcgccgc gacatcctcc atcggagagc cctggaagcc aaggacatct ttgctgtgga 

1981 gtacgggatt cccaaacact ttggtctctt ctacgctatg ggcattgcat tgatgatgga 

2041 aggggtgctc agtgcttgct accatgtctg ccctaattat tccaacttcc gattcgacac 

2101 ctccttcatg tacatgatcg ctggcctgtg catgctgaag ctctatcaga cccgccaccc 

2161 agacatcaat gccagcgcct actctgccta tgcctccttt gctgtggtca tcatggtcac 

2221 cgtccttgga gtggtgtttg gaaaaaatga cgtatggttc tgggtcatct tctctgcaat 

2281 ccacgttctg gcctcgctag ccctcagcac ccagatatat tatatgggtc gtttcaagat 

2341 agatttggga attttccggc gggctgccat ggtgttctac acagactgta tccagcagtg 

2401 tagccgacct ctatatatgg atagaatggt gttgctggtt gtggggaatc tggttaactg 

2461 gtccttcgcc ctctttggat tgatataccg ccccagggac tttgcttcct acatgctggg 

2521 catcttcatc tgtaaccttt tgctgtacct ggccttttac atcatcatga agctccgcag 

2581 ctctgaaaag gtcctcccag tcccgctctt ctgcatcgtg gccaccgctg tgatgtgggc 

2641 tgccgcccta tattttttct tccagaatct cagcagctgg gagggaactc cggccgaatc 

2701 ccgggagaag aaccgcgagt gcattctgct ggatttcttc gatgaccatg acatctggca 

2761 cttcctctct gctactgctc tgtttttctc attcttggtt ttgttaactt tggatgatga 

2821 ccttgatgtg gttcggagag accagatccc tgtcttctga acctccaaca ttaagagagg 

2881 ggagggagcg atcaatcttg gtgctgtttc acaaaaatta cagtgaccac agcaaagtaa 

2941 ccactgccag atgctccact caccctctgt agagccaact ctgcattcac acaggaagga 

3001 gaggggctgc gggagattta aacctgcaag aaaggaggca gaaggggagc catgttttga 

3061 ggacagacgc aaacctgagg agctgagaaa cacttgctcc ttccatctgc agctttggga 

3121 gtgcaacagg gataggcact gcatccaagt caactcacca tcttggggtc cctcccaccc 

3181 tcacggagac ttgccagcaa tggcagaatg ctgctgcaca cttccctcca gttgtcaccc 
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3241 tgcccagaaa ggccagcagc ttggacttcc tgcccagaaa ctgtgttggc ccccttcaca 

3301 cctctgcaac acctgctgct ccagcaagag gatgtgattc tttagaatat ggcggggagg 

3361 tgaccccagg ccctgcccta ctgggataga tgttttaatg gcaccagcta gtcacctccc 

3421 agaagaaact ctgtacattt cccccaggtt tctgatgcca tcagaagggc tcaggagtgg 

3481 ggtttgtcac acattcctct taacaagtaa ctgtcactgg gaccgagtcc tgggtgctta 

3541 catattcctt cgtgtcttca tctcactgac ctgtgtggac ctcatcactc tgactctgcc 

3601 ttcttggaaa ggccctgtca ctccacagat gtctggccag cttcaaggca gaaggaaaaa 

3661 caggaaaagc tcttttaaca gcagcaggaa caagagaaat gactaaccat actaaaagac 

3721 tggtaacagc agcagcagcc agacaggcct caccttaagg acttgggctg ccagagcaaa 

3781 ttcagcagag cttatttggc ctcccattca cacagctcag ttctgtgccc acatcacctt 

3841 tggggaagaa atcagcattc taatcaggga cactacttca ggagtcctcc acagcgagtc 

3901 cgtcatctgt cactttatgt agatcagggt tctagacttc ttccctgagg ttctcagaag 

3961 cagctctcag gatgaacgta ttgtcctctt cccctcttct tgcaaagtgc acagctaatc 

4021 taatgttgtc tctcggttgc acctgacatt ctctccccaq taaggtgttg gcaagctcag 

4081 catctgggtt ccactctcac actgtctggc agctctgtgt ctgagaagtt ctacattgac 

4141 caggccccct tgttgcctgg agtatgacgt aatcagaaaa tagacgtata aatgtgcaca 

4201 tgcgtatgta tttgcttgtg aaattaaagt cacctcttgc ctctgctttc ctgatcattc 

4261 gttagagaaa tggatcaggc atttttttaa attattattc tttctctaaa ctatttgcat 

4321 tgtgttcaaa aacccatttt agaagtttga acagcaagct tttcctgatt ttaaaaacac 

4381 aaagttgctt tcaatgaaat attttgtgat ttttttaaag tccccaaatg tgtacttagc 

4441 cttctgttat tccttattct ttaagcagtg ttggcttcca ttgaccatat gaaggccacc 

4501 aattaaatgg ttgtgttaat ccaacatgta aaaaactttt tggcagggca cagtggctca 

4561 cgcctgtaat cccaaagtgc tggaatttac aggtaggacc accacacttg gtccacttac 

4621 ttataataaa cattgatttg gtcgttaaaa aaaaaaaaaa aaaaaaaaa 



